Picture for Yiren Song

Yiren Song

PAI-Studio: Cinematic Video Background Replacement with Camera-Aware Motion

Add code
May 31, 2026
Viaarxiv icon

AnySurf: Any Surface Generation with Directed Edge

Add code
May 23, 2026
Viaarxiv icon

SWEET: Sparse World Modeling with Image Editing for Embodied Task Execution

Add code
May 19, 2026
Viaarxiv icon

OmniHumanoid: Streaming Cross-Embodiment Video Generation with Paired-Free Adaptation

Add code
May 12, 2026
Viaarxiv icon

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon

UENR-600K: A Large-Scale Physically Grounded Dataset for Nighttime Video Deraining

Add code
Apr 06, 2026
Viaarxiv icon

Unlocking the Latent Canvas: Eliciting and Benchmarking Symbolic Visual Expression in LLMs

Add code
Mar 15, 2026
Viaarxiv icon

SIGMA: Selective-Interleaved Generation with Multi-Attribute Tokens

Add code
Feb 07, 2026
Viaarxiv icon

Loom: Diffusion-Transformer for Interleaved Generation

Add code
Dec 20, 2025
Figure 1 for Loom: Diffusion-Transformer for Interleaved Generation
Figure 2 for Loom: Diffusion-Transformer for Interleaved Generation
Figure 3 for Loom: Diffusion-Transformer for Interleaved Generation
Figure 4 for Loom: Diffusion-Transformer for Interleaved Generation
Viaarxiv icon

Mitty: Diffusion-based Human-to-Robot Video Generation

Add code
Dec 19, 2025
Viaarxiv icon